Improving speaker de-identification with functional data analysis of f0 trajectories

نویسندگان

چکیده

Due to a constantly increasing amount of speech data that is stored in different types databases, voice privacy has become major concern. To respond such concern, researchers have developed various methods for speaker de-identification. The state-of-the-art solutions utilize deep learning which can be effective but might unavailable or impractical apply for, example, under-resourced languages. Formant modification simpler, yet method de-identification requires no training data. Still, remaining intonational patterns formant-anonymized may contain speaker-dependent cues. This study introduces novel method, which, addition simple formant shifts, manipulates f0 trajectories based on functional analysis. proposed will conceal plausibly identifying pitch characteristics phonetically controllable manner and improve formant-based up 25%.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Functional data analysis of juggling trajectories

Scope The Dutch/Flemish Classification Society, VOC, aims at communicating scientific principles, methods, and applications of ordination and classification. The VOC is a member of the International Federation of Classification Societies (IFCS). Profile Analysis: A complementary way of reporting results in large assessments 14.40 Lieke Voncken Continuous norming of psychological tests: A compar...

متن کامل

Identification of mild cognitive impairment disease using brain functional connectivity and graph analysis in fMRI data

Background: Early diagnosis of patients in the early stages of Alzheimer's, known as mild cognitive impairment, is of great importance in the treatment of this disease. If a patient can be diagnosed at this stage, it is possible to treat or delay Alzheimer's disease. Resting-state functional magnetic resonance imaging (fMRI) is very common in the process of diagnosing Alzheimer's disease. In th...

متن کامل

Joint analysis of f0 and speech rate with Functional Data Analysis

In this work we propose the use of Functional Data Analysis (FDA) as a powerful methodology to tackle problems where multiple continuous speech parameters have to be analyzed jointly. A production study on contrastive focus placement in Neapolitan Italian is used as illustration. Two features are analyzed, viz. f0 and relative speech rate, both expressed as continuous functions of time. The res...

متن کامل

Limited data speaker identification

In this paper, the task of identifying the speaker using limited training and testing data is addressed. Speaker identification system is viewed as four stages namely, analysis, feature extraction, modelling and testing. The speaker identification performance depends on the techniques employed in these stages. As demonstrated by different experiments, in case of limited training and testing dat...

متن کامل

Improving speaker segmentation via speaker identification and text segmentation

Speaker segmentation is an essential part of a speaker diarization system. Common segmentation systems usually miss speaker change points when speakers switch fast. These errors seriously confuse the following speaker clustering step and result in high overall speaker diarization error rates. In this paper two methods are proposed to deal with this problem: The first approach uses speaker ident...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Speech Communication

سال: 2022

ISSN: ['1872-7182', '0167-6393']

DOI: https://doi.org/10.1016/j.specom.2022.03.010